Memory Fault Tolerance Software Mechanisms: Design and Configuration Support through SWN Models
نویسندگان
چکیده
This paper presents a case study of a software fault tolerance mechanisms, the distributed memory, designed and implemented within the european projects TIRAN and DEPAUDE, and currently under study within the Italian project ISIDE. The studied mechanisms are part of a complete framework of general purpose software fault tolerance mechanisms. In this paper we show a method for the compositional construction of models of the DM and of the environment in which it operates, expressed in the Stochastic Well Formed Nets (SWN) formalism. Different versions of submodels, at different detail level are presented and compared using some behaviour inheritance notions taken from the literature.
منابع مشابه
Validation and Evaluation of a Software Solution for Fault Tolerant Distributed Synchronization
This paper presents a case study on the combined use of different tools and techniques for the validation and evaluation, from the early stages of the design, of a fault tolerant software mechanism named distributed synchronization The mechanism has been specified using UML state charts and sequence diagrams. A number of Stochastic Well-formed Nets (SWN) models have been derived from the specif...
متن کاملA Survey on Fault Tolerance in Work flow Management and Scheduling
Fault Tolerance is a configuration that prevent a computer or network device from failing in the event of unexpected problem or error such as hardware failure, link failure, unauthorized access, variations in the configuration of different systems and system running out of memory or disk space. The integration of fault tolerance measures with scheduling gains much importance. Workflow managemen...
متن کاملFault Tolerant Memory Design for HW/SW Co-Reliability in Massively Parallel Computing Systems
A highly dependable embedded fault-tolerant memory architecture for high performance massively parallel computing applications and its dependability assurance techniques are proposed and discussed in this paper. The proposed fault tolerant memory provides two distinctive repair mechanisms: the permanent laser redundancy reconfiguration during the wafer probe stage in the factory to enhance its ...
متن کاملAutomatic Generation of Fault-Tolerant CORBA-Services
The Common Object Request Broker Architecture (CORBA) is the most successful representative of an object-based distributed computing architecture. Although CORBA simplifies the implementation of complex, distributed systems significantly, the support of techniques for reliable, fault-tolerant software, such as group communication protocols or replication is very limited in the state-of-the-art ...
متن کاملFast Checkpoint/Recovery to Support Kilo-Instruction Speculation and Hardware Fault Tolerance
The increased relative cost of accessing memory is encouraging processor designers to explore deeper uniprocessor speculation (e.g., with branch and value prediction) and consider multiprocessor speculation (e.g., on coherence message types and values). While some mechanisms have been proposed to support deep speculation using speculative multithreading, current mechanisms for conventional proc...
متن کامل